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SUBSCRIBER UNE QUALIFICATION WITH NUMERAL NETWORKS WITH RESPECT TO DATA TRANSMIS- 
SION CHARACTERISTICS 

Background of the Invention 

This application relates generally to communications 
5 networks, and more particularly, to testing communication 
lines. 

Recently, there has been an increased demand for plain 
old telephone systems (POTS ! s) to carry high-speed digital 
signals. The demand has been stimulated by home access to 

10 both the Internet and distant office computers. Both types 
of access typically employ a POTS line as part of the path 
for carrying digital signals. 

POTS ! s lines were built to carry voice signals at 
audible frequencies and can also carry digital signals as 

15 tones in the near audible frequency range. Modern digital 
services such as ISDN and ADSL transmit data at frequencies 
well above the audible range. At these higher frequencies, 
POTS ! s lines that transmit voice signals well may transmit 
digital signals- poorly . Nevertheless, many telephone 

20 operating companies (TELCO's) would like to offer ISDN and/or 
ADSL data services to their subscribers. 

Telephone lines between a TELCO switch and a 
subscriber's premises are frequent sources of poor 
performance at the high frequencies characteristic of ISDN 

25 and ADSL transmissions. Nevertheless, high cost has made 
widespread replacement of these subscriber lines an 
undesirable solution for providing subscribers with lines 
capable of supporting ISDN and ADSL. A less expensive 
alternative would be to remove only those subscriber lines 

30 that are inadequate for transmitting high-speed digital data. 
To enable limited replacement of inadequate lines, 
TELCO's have placed some emphasis on developing methods for 
predicting which subscriber lines will support daua services, 

- 1 - 



WO 01/24490 PCT/US00/25206 



such as ISDN and ADSL. Some emphasis has been also placed on 
predicting frequency ranges at which such data services will 
be supported. Some methods have also been developed for 
finding faults in subscriber lines already supporting data 
5 services so that such faults can be repaired. 

Current methods for predicting the ability of 
subscriber lines to support high-speed digital transmissions 
are typically not automated and labor intensive. Often, 
these methods entail using skilled interpretations of high 

10 frequency measurements of line parameters to determine data 
transmission abilities. At a network scale, such tests are 
very expensive to implement. 

The present invention is directed to overcoming or, at 
least, reducing the affects of one or more of the problems 

15 set forth above. 

Summary of the Invention 

In a first aspect, the invention provides a method of 
testing, a subscriber line. The method includes determining 

20 values of electrical line features from electrical 

measurements on the subscriber line and processing a portion 
of the values of the electrical features with a neural 
network. The neural network predicts whether the line 
qualifies to support one or more preselected data services 

25 from the portion of the values. 

In a second aspect, the invention provides a method of 
constructing a test for qualifying subscriber lines for data 
transmissions. The method includes obtaining electrical 
feature and qualification data for sample lines of a 

30 training set and determining parameters defining a neural 
network from the data of the training set . The network is 
configured to use values of electrical properties of a 
subscriber line to predict whether the subscriber line 
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qualifies for one or more preselected data services. 

In a third aspect, the invention provides a method of 
testing a subscriber line. The method includes determining 
values of electrical features of the line from electrical 
5 measurements on the subscriber line and forming a vector 
having the values as components. The method also includes 
determining whether the vector is a member of a cluster of 
feature vectors and predicting whether the line qualifies for 
a data service based in part on the determination of cluster 

10 membership. Feature vectors of the cluster are associated 
with sample lines of a training set. 

In a fourth aspect, the invention provides a data 
storage medium storing a computer executable program of 
instructions for performing one or more of the above- 

15 described methods. 

Brief Description of the Drawings 

Other features and advantages of the invention will be 
apparent from the following description taken together with 
20 the drawings in which: 

FIG. 1 shows a portion of a POTS network having a 
system for testing subscriber telephone lines; 

FIG. 2 illustrates a device for performing one-ended 
electrical measurements on a subscriber line; 
25 FIG. 3A is a flow chart showing a method of qualifying 

subscriber lines that uses a feed- forward neural network; 

FIG. 3B is a flow chart showing a method of smoothing 
electrical measurements prior to processing by the neural 
network of FIG. 3A; 
3 0 FIG. 4A is a flow chart showing a method of processing 

feature vectors through the neural network of FIG. 3A; 

FIG. 4B is a block diagram showing the flow of data 
through the layers of the neural network of FIG. 4A; 
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FIG. 5 is a flow chart showing a method of constructing 
the neural network of FIGs . 4A-4B; 

FIG. 6 is a flow chart showing a method of determining 
the compression layer of the neural network of FIGs. 4A-4B; 
5 FIG. 7 is a flow chart showing a method of determining 

the basis layer of the neural network of FIGs. 4A-4B; and 

FIG. 8 is a flow chart showing a method of determining 
the weights the output layer of the neural network of FIGs. 
4A-4B. 

10 • Description of the Preferred Embodiments 

MEASUREMENT AND TEST APPARATUS 
FIG. 1 shows a portion of a POTS network 10 that 
includes a system 11 for testing subscriber lines 12-14. 
The subscriber lines 12-14 connect subscriber units 16-18, 

15 i.e., modems and/or telephones, to a telephony switch 15. 
The switch 15 connects the subscriber lines 12-14 to the 
remainder of the telephone network 10. The switch 15 may be 
a POTS switch or another device, e.g., a digital subscriber 
loop access multiplexer (DSLAM) . The switch 15 and testing 

20 system 11 may be located in one or more switching stations 
of a TELCO. 

Each subscriber line 12-14 consists of a standard 
twisted two-wire telephone line adapted to voice 
transmissions . The two wires are generally referred to as 
25 the ring R and tip T wires. 

A large portion of each subscriber line 12-14 is housed 
in one or more standard telephone cables 22. The cable 22 
carries many subscriber lines 12-14, e.g., more than a 
dozen, in a closely packed configuration. The close packing 
3 0 creates an electrical environment that changes transmission 
properties of the individual subscriber lines 12-14. 

A measurement unit 4 0 performs electrical measurements, 
used in tests of the lines 12-14. The measurement unit 40 
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includes a device 43. that performs one-ended electrical 
admittance measurements on tip and ring wires of the lines 
12-14. The measurement unit 40 may also house other devices 
(not shown) for performing other types of one-ended 
5 electrical measurements, e.g., to test the line for selected 
line faults. The measurement unit 40 couples to the switch 
15 via a test bus 42. 

The device 43 connects to the switch 15 through the 
test bus 42 and a standard voice test access 44. The voice 

10 test access 44 electrically connects the device 43 to the 

subscriber lines 12-14 selected for testing. The voice test 
access 44 can transmit electrical signals having frequencies 
between about 100 Hertz (Hz) and 20 kilo Hz (KHz) , i.e., low 
compared to frequencies used by ISDN and ADSL data services. 

15 The measurement unit 40 is controlled by computer 46, 

which selects the types of measurements to perform and the 
subscriber lines 12-14 to test. The computer 46 sends 
control signals to the measurement unit 4 0 and receives 
measurement results from the measurement unit 40 via the 

20 same line 48 . 

The computer 4 6 executes a software program to control 
line testing by the measurement unit 40. The program also 
processes and interprets the results from the measurement 
unit 40 to determine whether to qualify or disqualify the 

25 lines 12-14 for preselected high-speed data services. The 
software program is stored, in executable form, in a data 
storage device 49, e.g., a hard drive or random access 
memory (RAM) . The program may also be encoded on a readable 
storage medium 50, such as an optical or magnetic disk, from 

3 0 which the program can be executed. 

To perform a test, the measurement unit 4 0 signals the 
voice test access 44 to disconnect a selected line 12-14 
from the network 10 and to reconnect the line 12-14 to wires 
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of the bus 42 connecting to the internal device 43. Then, 
the internal device 43 performs one-ended electrical 
measurements on the selected line 12-14. After the 
measurements are completed, the measurement unit 4 0 signals 
5 the switch 15 to reconnect the line 12-14 to the remainder 
of the POTS network 10. 

The computer 46 can qualify or disqualify selected 
subscriber lines 12-14 for data services prior to fully 
connecting the lines 12-14 to the subscriber units 16-18. 

10 Qualification is based on determining, with high certainty, 
that a selected line 12-14 will support a specified data 
service. Disqualification is based on determining, with 
high certainty, that the selected line 12-14 will not 
support the specified data service. 

15 FIG. 2 illustrates the device 43 that performs one- 

ended electrical measurements on the subscriber lines 12-14 
of FIG. 1. The measurements may be used to speed qualify 
each line 12-14 at high frequencies as is described below. 
The measurements can also be used to detect line faults, 

20 such as bridged taps, gauge changes, split pairs, resistive 
imbalances, load coils, metallic faults such as shorts and 
open lines, and capacitive imbalances. Methods for 
detecting such faults with the device 43 have been described 
in U.S. Application No. 09/294,563 C563), filed April 20, 

25 1999 and U.S. Application No. 09/285,954 ('954), filed April 
2, 1999. These applications are incorporated by reference, 
in their entirety, in the present application. 

The device 43 is adapted to measure admittances between 
the tip wire T, ring wire R, and ground G for the subscriber 

30 line 12-14 being tested. The tip and ring wires T, R of the 
line 12-14 being tested couple to driving voltage sources Vi 
and V 2 through known conductances G t and G r . The tip and 
ring wires T, R also connect to voltmeters V t and V r . The V t 
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and V r voltmeters read the voltage between the tip wire T 
and ground G and between the ring wire R and ground G, 
respectively. The readings from the voltmeters V t and V r 
enable the computer 46 to determine three admittances Y tg , 
5 Ytr, and Y rg between the pairs tip-to-ground, tip-to-ring, 

and ring- to -ground, respectively. The device 43 can measure 
the admittances Y tg , Y tr , and Y rg at preselected frequencies 
of the range supported by the voice test access 44 . The 
'563 application has described the steps for performing such 
10 measurements in detail. 

The capacity of a subscriber line to support high-speed 
data transmissions is defined by values of the signal 
attenuation at high frequencies. For ISDN and ADSL services, 
the qualification classes for data transmission are defined 

15 as follows: 

ISDN qualified if attenuation is above -47dB at 100KHZ; 
ADSL qualified if attenuation is above -40dB at 300KHZ; 
Disqualified if attenuation is in another range. 

Other data services may entail different constraints on the 
20 signal attenuation at high frequencies. Nevertheless, 
qualification for any high-speed data service entails 
constraints at higher frequencies than the frequencies 
measured at which the device 4 3 performs electrical 
measurements . 

25 Measuring line properties, e.g., the attenuation, 

directly at the frequencies that define the qualification 
classes is inconvenient, because the voice test access 44 
only supports low frequency measurements. Thus, methods 
have been developed for extrapolating low frequency 

3 0 measurements to predict the high frequency attenuation. 
Those extrapolations are prone to inaccuracies and have 
resulted in mispredictions of line speed qualification 
status . 
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QUALIFICATION WITH NEURAL NETWORKS 

As used herein, a neural network is a process that 
generates, from an input feature vector, a set of confidence 
values that indicate probabilities of class memberships. The 
5 input feature vector has properties of a subscriber line as 
components, e.g., one-ended electrical measurements and 
electrical properties derived therefrom. The various 
embodiments use neural networks in which the classes are 
qualification classes for high-speed data services or 

10 classes of lines with or without selected types of 
conditions or faults. 

FIG. 3A illustrates a method 100 that uses low 
frequency measurements and a feed- forward neural network to 
speed qualify or disqualify the subscriber lines 12-14 for 

15 high-speed data services, such as ISDN and ADSL. The neural 
network can increase the accuracy of qualification tests 
based on low frequency measurements. For example, one 
embodiment that uses a first -order neural network has 
correctly predicted the capacity of unknown subscriber lines 

20 to support ADSL and ISDN with an accuracy of more than 90% 
in test situations. 

To test a selected subscriber line 12-14, the system 11 
evaluates a set of preselected electrical properties for the 
selected line 12-14 (step 101) . The preselected properties 

25 include one-ended electrical measurements made with the 

measurement unit 40 and other properties directly derived 
from such measurements. The measured properties include the 
one-ended admittances Y tg , Y tr , and Y rg , and the derived 
properties include ratios of admittances and capacitances at 

30 preselected frequencies. From the selected electrical 

properties, the computer 4 6 creates an electrical feature 
vector whose ordered components are the values of the 
preselected electrical properties (step 102) . After 
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creating the feature vector, the computer 4 6 processes the 
vector with a feed- forward neural network, which has a 
multi-layered structure (step 103). From an input feature 
vector, the neural network predicts whether the associated 
5 line 12-14 qualifies or disqualifies for a preselected set 
of high-speed data services, e.g., ISDN and/or ADSL. The 
neural network is encoded in a portion of the software 
program, which is stored in the data storage device 49 
and/or the readable medium 50 in a computer- executable form. 

10 Some embodiments combine the method 10 0, based on a 

neural network, with methods for detecting faults to predict 
whether a subscriber line qualifies for high-speed data 
services. In such embodiments, the prediction of whether a 
line is qualified is based both on the predictions of the 

15 neural network and on whether the line has a line fault. 
For example, the existence of a fault may be a quality 
factor that can modify the confidence levels for class 
membership, which are predicted by the neural network. 
Fig. 3B is a flow chart showing a method 104 of 

20 obtaining smoothed electrical properties from one -ended 

measurements on a subscriber line 12-14. The measurement 
unit 40 measures admittances Y tg , Y t r, and/or Y rg of the line 
12-14 at a preselected set of frequencies w f" (step 105). 
One embodiment uses a set of 45 frequencies starting at 150 

25 Hz and having an increment of 45 0 Hz between adjacent 
frequencies . 

The measurements . of the admittances Y tg , Y tr , and/or Y rg 
are susceptible to noise. The noise can corrupt the 
measurements enough to interfere with results obtained from 
30 the neural network. To reduce noise interference, the 

computer 46 smoothes the measured admittances (step 106) . 

To implement smoothing, the computer 4 6 expands each 
measured admittance U Y S " with a polynomial B s (f) of the 
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form: 

B s (f) = E D j=o 8 S jPj(f) • 
In the expansion, the Pj(f)'s form an orthogonal polynomial 
basis, e.g., Legendre polynomials. For measurements at the 
5 above -described forty-five frequencies, an expansion with 
the eight lowest Pj(f)'s can produce adequate smoothing, 
i.e., D=8. The computer 46 determines the coefficients, 8j, 
from the measured Y 8 (f)'s using the formula: 

5 S j = <Y s (f) , Pj(f)>. 

10 Here, <,> is the inner product for which the Pj(f)'s are 
othonormal. The values of the polynomials B s (f) at a 
preselected set of frequencies w f" define the measured 
impedances of the line 12-14, which are inputted into the 
neural network. In the polynomial B s (f), noise effects are 

15 reduced by averaging. 

The computer 4 6 calculates derived electrical 
properties from the polynomial B s (f) instead of directly 
from measurements (step 107) . The derived properties may 
include capacitances, ratios of admittances and derived 

2 0 properties at fixed frequencies, ratios of admittances to 

frequency, and peaks and valleys of the frequency-dependent 
admittances. Again, the use of the smoothing polynomial, 
B s (f), to calculate these properties reduces noise effects. 
In step 102 of FIG. 3A, the feature vector for the 

25 subscriber line 12-14 is a concatenation of measured and 

derived electrical properties that have been determined from 
B s (f) . 

FIG. 4A is a flow chart 110 illustrating step 103 of 
FIG. 3A in which the computer 46 processes the feature 
30 vector to determine the qualification status of the 

associated line 12-14. The processing occurs sequentially 
in three layers of the neural network 115 shown in FIG. 4B. 
In the first or compression layer 116 of the neural 
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network 115, the computer 46 compresses a N-dimensional 
feature vector Z a by performing a projection into a 
preselected M-dimensional subspace of the entire N- 
dimensional feature space (step 112) . To perform the 
5 projection, the computer 46 evaluates scalar products 
between the feature vector Z a and a special set of 
orthonormal basis vectors spanning the subspace. Since the 
subspace has a lower dimension (M < N) , the projection 
produces a compressed feature vector X a with less components 

10 than the input feature vector Z a thereby making subsequent 

processing more rapid. The projection removes components of 
the feature vectors that do not vary substantially over 
sample lines of a training set. Such components are less 
indicative of the qualification status of the associated 

15 lines, because they have similar values for both qualified 
and disqualified lines. 

In the second or basis layer 117 of the neural network 
115, the computer 46 determines whether the compressed 
vector X a is a member of one or more clusters of feature 

20 vectors located -in the projected subspace (step 113) . To 
determine cluster memberships, the computer 46 evaluates 
fuzzy variables whose values represent probabilities of 
belonging to the clusters associated with the variables. 

The clusters are localized groups of compressed feature 

25 vectors associated with the sample lines of the training 

set. Each cluster is located in the projected subspace of 
the entire feature space and can be modeled as a hyper- 
ellipsoidal region having a fuzzy boundary. Fuzzy functions 
on the clusters form the basis layer 117 of the neural 

3 0 network 115. 

For a cluster K, the associated fuzzy variable y K is a 
fuzzy Horn clause. The value y K of the fuzzy Horn clause 
for a projected feature vector X is given by: 
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y K (X) = exp[-(X - V K ) t D K 2 (X - V K )] 
V K is the location of the center of the cluster K in the 
projected feature vector space. D K 2 is a diagonal matrix 
whose entries, cm" 2 , are the squared widths of the hyper- 
5 ellipsoidal region along directions "i" . The em's express 
the fuzziness of the cluster K. The value y K (X) is 
indicative of the probability that the line 12-14 with 
projected feature vector X is a member of the cluster K. 

In the last or output layer 118 of the neural network 

10 115, the computer 4 6 predicts the selected line's membership 
to each qualification class (step 114) . The class 
membership predictions produce confidence values C Q for 
membership to each qualification class, Q, recognized by the 
neural network- The confidence value C Q is defined by a 

15 class membership function whose form depends on an rxC 
matrix of weight parameters, W Q . C is the number of 
clusters in the class Q, and r is a fixed non-negative 
integer that defines how results are biased, e.g., r = 0 or 
1. For class Q, the confidence value C Q is given by: 

20 C Q (X) « {l + exp[-(X r , IT W Q (Y(X) )] }~\ 

Here, Y(X) is the vector [yi(X), y c (X), l] t of the fuzzy 

variables representing the probabilities that the projected 
feature vector X is a member of the C clusters of the class 
Q. X r is a dimensionally reduced version of the projected 

25 feature vector X in which only the "r" most significant 
terms are retained. 

The confidence values Cq's express a relationship 
between cluster membership and class membership. The 
relationship quantifies the face that projected feature 

30 vectors of a single cluster have similar qualification class 
memberships . 

For each feature vector, the neural network 115 
generates a set of confidence values that indicate the 
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probability of membership to each qualification class at the 
output 119. The computer 46 reports that the associated 
subscriber line 12-14 belongs to the qualification class for 
which the confidence value is highest. 
5 The form of the neural network 115 is fixed by the 

parameters defining each of the three layers 116-118. The 
parameters define the feature space projection of the 
compression layer 116, the cluster geometry of the basis 
layer 117, and the weights of the output layer 118, which 

10 relate memberships to clusters and qualification classes. 

FIG. 5 illustrates a method 120 for constructing the 
feed-forward neural network 115 used in FIGs. 3A, 4A, and 
4B. To construct the neural network 115, data for a 
training set of sample lines is acquired through direct 

15 measurements and/or calculations (step 122) . The data 
includes both an electrical feature vector and a 
qualification classification for each sample line. The 
properties of the sample lines belonging to the training set 
may correspond to the expected line properties for the 

2 0 -unknown subscriber lines to be tested with the neural 

network 115. When such a correspondence exists, the neural 
network is generally expected to predict line qualification 
status with a higher accuracy. From the training set, the 
computer 4 6 determines the parameters that define the neural 
25 network 115 by a learning process (step 124) . 

Localizing both capacities to create and to use the 
neural network 115 on the same computer 4 6 leads to some 
advantages. Then, the system 11 is more adaptable to 
changes to the general population of subscriber lines 12-14. 

3 0 In response to changes, the computer 4 6 re -adapts the neural 

network 115 by recalculating the network parameters with a 
training set whose properties are more aligned with the 
changed population of subscriber lines 12-14. 
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FIG. 6 is a flow chart illustrating a method 125 of 
determining the parameters that define the compression layer 
116 of the neural network 115. The projection, which 
produces the compression, employs a Karhunen-Loeve (KL) 
5 transformation. The KL transformation provides an expansion 
of feature vectors with a special orthonormal basis of N 
vectors, cpi. In this basis, an arbitrary feature vector, X, 
is written as: 

X = I N i= i k i( pi + <X>. 
10 Here, <> denotes an average over the sample lines in the 
training set, i.e., <X> is the average of the feature 
vector. 

Each basis vector, q>i, is an eigenvector of a 
covariance matrix R, which is defined as an average over the 
15 feature vectors of the training set. The matrix R is: 

R = < (X-<X>) (X-<X>) t >. 
The basis vectors q>i satisfy the equations: 

R<pi = A,i(pi 
in -which the Ju are the eigenvalues. 
20 To determine the parameters of the compression layer 

116, the computer 46 evaluates the covariance matrix R on 
the feature vectors of the training set (step 126) . Then, 
the computer 46 determines the eigenvalues and eigenvectors 
of the covariance matrix R (step 127) . 
25 In terms of the expansion over the special basis {<p±}, 

the compression layer 116 compresses a feature vector X by 
simply deleting components of vector X to produce a 
truncated vector, X P . The truncated vector X P is given by: 

X P = = Z M i=i kicpi. 

3 0 The truncation deletes the terras of the expansion of X over 
(pi's corresponding to small eigenvalues Xi of the covariance 
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matrix R. The retained terms correspond to (pi's whose 
eigenvalues Ju are greater than a preselected threshold, 
e.g., max(Ju)/250 (step 128). The coefficients (ki, k 2 ,...,k M ) 
for the retained terms are given by: 

5 (ki, k 2 ,..., k M ) = (<pi # <p2r-r (pM) t .X. 

By only retaining M eigenvectors associated with the M 
largest eigenvalues, the dimension of the feature vectors is 
reduced by compression from N to M. The eigenvectors (pi, 
for i = 1, M, define the compression layer 116. 
10 The truncation of the feature vector X generates an 

error vector "e" , which is given by: 

e = Z N i=M+i ki<pi. 

The error vector *e" , corresponds to a sum of squares error, 
e, which has the form: 
15 e - Z N i= M+ i kiki = Z N i= M+ i ( pi t (X-<X>) (X-<X>) t ( p i 

Thus, the average of the error g over the training set is: 

<6> = Z i=M+l (pi ri (pi = 2, i=M+l Or - 

Since the Ju appearing in <s> are the smallest eigenvalues 
of R, the feature space projection of step 112 is optimal. 

20 The feature space projection reduces the size of feature 
vectors and generates minimal "average" errors. For 
example, one embodiment that truncates eigenvectors having 
eigenvalues smaller than max(Ju)/250 removes about two 
thirds of the components of feature vectors. 

25 FIG. 7 is a flow chart illustrating a method 130 of 

determining the parameters that define the basis layer 117 
of clusters in projected feature space. The method uses 
fuzzy C-means clustering to recursively determine the basis 
layer from the data in the training set. At each recursion, 

30 both the center location V a of each cluster "a" and the 

membership values are updated by minimizing a fuzzy cost 
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function, E, over the data in the training set. Minimizing 
E reduces inter-cluster overlaps by grouping together 
projected feature vectors based on both geometry and a 
relation. The relation assigns each feature vector of the 
5 training set to the clusters of a single qualification 
class, i.e., to the class to which the associated sample 
line belongs. Thus, each feature vector of the training set 
is used to construct the clusters of one qualification 
class . 

10 For each class, the fuzzy cost function E provides a 

fuzzy measure of clustering. Each fuzzy cost function E is 
defined by: 

where the sum runs over the U C" clusters of the associated 
15 qualification class. For each projected feature vector Xm, 
the fuzzy variables ^ ma are indicative of membership to 
cluster "a". When summed over the C clusters of the class to 
which the associated sample line belongs, the ^ a add up to 

one , i.e., £^a=i jima = 1 • 

20 To find the parameters defining the functions of the 

basis layer 117, the computer 46 searches for clusters in the 
distribution of projected feature vectors for the training 
set. Before starting a search, the computer 46 receives a 
value for the number C of clusters in a qualification class, 

25 e.g., a guessed value of C provided by an operator (step 

132) . After receiving the number of clusters, the computer 
46 guesses the center location V a for each cluster "a" of the 
class (step 134) . From the guessed center locations, the 
computer 4 6 calculates values for cluster membership 

3 0 variables associated with each feature vector X m of a 
sample line in the same class (step 13 6) . The cluster 
membership variables are found from the center locations, by 
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solving the equations: 

(^ar 1 = Z C a-l C{|X m - V a |/|X m - V a > | } ] ^ - 

In which "fuzz" is a preselected fuzziness parameter. Fuzz 
is greater than 1, e.g., fuzz may be between 1.6 and 1.7. 
5 Using the cluster membership variables ^ma's, the 

computer 46 updates the center locations, i.e., the V a 's, of 
the clusters (step 138) . The updated center locations are 
determined from the equation: 

V a = [^(^irna)' 1122 X m ] / [ l»> (|*'«) *""] 

10 The sums are over projected feature vectors Xm and X ra ' of the 
training set. Next, the computer 46 determines whether 
distances between the present and last values of the center 
locations of the clusters are small enough, i.e., less than a 
preselected threshold value (step 140) . If any differences 

15 exceed the threshold value, the computer 4 6 executes loop 142 
to start another cycle of updating the geometric variables V a 
and jima, which define the clusters. 

If all the distances between present are below the 
threshold value, the computer 4 6 calculates the widths of 

20 each cluster (step 142). Since each cluster "a" is a hyper- 
ellipsoidal region, the width <D ab of cluster u a" along 
direction xx b" is related to the range of projected feature 
vectors belonging to the cluster "a" along the direction u b" . 
One definition of the width 0 ab is given by: 

25 Oab = Zra(fima m|X m , b " V a , b| 2 ] 1/2 

Here, the component of a vector Z along direction "b" is 
indicated by Z, b . 

FIG. 8 is a flow chart illustrating an iterative method 
150 of determining the weights W 0 in the class membership 
30 functions C Q (X) . The method is based on a Gaussian noise 

model for the learning process in which the N-the recursive 
estimate for the confidence value Zi is given by: 
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Zi(N) = fi(N) + *l 
The model's noise, *i is Gaussian and has zero mean, i.e., 

<*i> = 0, and a variance <*i*i> = Zi 2 - 

The weight parameters W Q are determined by using an 
5 extended Kalman estimator. The extended Kalman estimator 
approximates the nonlinear function f i (N) of the weights W Q 
by linearizing the dependence on W Q to obtain the form: 

Zi(N) = E ca Hi, a (N)W a (N) + *i with Hi, a (N) = Mf l (N) /M W* (N) . 
Henceforth, the index "a" includes all matrix- indices of the 
10 weight matrix VI* (N) . W 3 (N) is the Nth recursive estimate to 
the optimal weight matrix W 3 (oo) . 

To determine the weight matrix, the computer 46 employs 
a recursive algorithm. The computer 46 initializes the 
weights and the covariance, P(N), for the error, Er(N) , in 
15 the weights, i.e., Er(N) = W a (N) - W a (oo), over the training 
set (step 152) . For the Nth estimate, the error in the 
weights is: Er(N) = W a (N) - W a (oo) . The initial values or 
the 0th estimate can be chosen to be: P(0) = 10 3 I and W a (0) 
= 0 where I is the unit matrix on the space of weight 
20 matrices W a . 

The computer 4 6 calculates new or Nth estimates of 
K a (N), Pi(N), W a (N), fi(N), and Hi, a (N) using the last or (N- 
l)th estimates to these quantities (step 154). At the Nth 
iteration, the calculation entails solving the following 
25 equations sequentially: 

K a (N) = E cb Pi, ab (N-l) Hi, b (N-l) V (Zi 2 +Hi (N-l) Pi (N-l) Hi (N-l) l ) , 
Pi(N) = [I - *ca Ka(N)H(N)i, a ]Pi(N-l) , 
W^N) = W^N-l) + Ka(N)[Mi(X) - fi(N-l)], 
f i (N) = f i (W a (N) ) , and 
30 Hi, a (N) = Mfi(N)/M W a (N) . 

The update is performed for each vector X in the training 
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set. Mi(X) is the actual class membership value of X in 
class "1", i.e., 1 or 0. After calculating the Nth 
approximation for each feature vector, the computer 
calculates the squared error in the weight vectors averaged 
5 over the training set, i.e., <tr[Pi(N)]> (step 156). The 
computer 46 determines whether the error, i.e., <tr[Pi(N)]>, 
is less than a preselected value (step 158) . If the error is 
below the preselected value, the computer 46 records the 
value of the weight matrix as the form to be used in the 

10 neural network 115 (step 160) . Otherwise, the computer 46 

performs a loop 162 to calculate the (N+l)th estimated, i.e., 
Ka(N+l), Pi(N+l), W a (N+l), fi(N+l), and Hi. a (N+l) . 

Other embodiments are within the scope of the 
following claims. 

15 What is claimed is: 
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1 i . A method of testing a subscriber line, comprising: 

2 determining values of electrical line features from 

3 electrical measurements on the subscriber line; and 

4 processing a portion of the values of the electrical 

5 features with a neural network that predicts whether the line 

6 qualifies to support one or more preselected data services 

7 from the portion of the values. 

1 2. The method of claim 1, wherein a line qualifies 

2 for one of the data services if the line attenuates data at a 

3 preselected frequency by less than a preselected amount. 

1 3. The method of claim 2, wherein the preselected 

2 frequency is substantially higher than the frequencies of the 

3 electrical measurements. 

1 4* The method of claim 1, wherein the measurements 

2 are one-ended electrical measurements. 

1 5. The method of claim 4, wherein the measurements 

2 are performed through a test access of a switch. 

1 6. The method of claim 4, further comprising 

2 compressing a vector whose components are the 

3 electrical features into a subspace; and 

4 predicting whether the line qualifies based on the 

5 compressed vector. 

1 7. The method of claim 6, wherein variances of 

2 lengths of components of a set of sample lines have smaller 

3 values in an orthogonal complement of the subspace than in 
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4 the subspace. 



1 8. The method of claim 4, wherein the processing 

2 comprises: 

3 determining whether the vector belongs to a cluster of 

4 feature vectors, each feature vector of the cluster being 

5 associated with a sample line of a training set. 

1 9. The method of claim 8, wherein the vector belongs 

2 to the cluster in response to being within a geometrical 

3 region defined by the feature vectors of the training set 

4 that belong to the cluster. 

1 10. The method of claim 8, further comprising: 

2 • predicting that the subscriber line is qualified for a 

3 particular data service in response to the vector belonging 

4 to one of the clusters defined by feature vectors of sample 

5 lines that are qualified for the particular data service. 

1 11. The method of claim 8, further comprising: 

2 predicting that the subscriber line is disqualified for 

3 a particular data service in response to the vector belonging 

4 to one of the clusters defined by feature vectors of sample 

5 lines that are disqualified for the particular data service. 

1 12. The method of claim 2, wherein the data services 

2 comprise one of ISDN and ADSL . 

1 13. The method of claim 2, further comprising: 

2 determining whether the subscriber line has a line fault. 

1 14. The method of claim 14, further comprising: 

2 predicting whether the line qualifies for one of the 
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3 selected data services, the prediction based in part on 

4 whether the line has a line fault. 

1 15. The method of claim 14, wherein the line fault 

2 includes one of a bridged tap, a gauge change, a split pair, 

3 a resistive imbalance, a load coil, a metallic fault, and a 

4 capacitive imbalance. 

1 16. A method of constructing a test for qualifying 

2 subscriber lines for data transmissions, comprising: 

3 obtaining electrical feature and qualification data for 

4 sample lines of a training set; 

5 determining parameters defining a neural network from 

6 the data of the training set, the network is configured to 

7 use values of electrical properties of a subscriber line to 

8 predict whether the subscriber line qualifies for one or more 

9 preselected data services . 

1 17. The method of claim 17, wherein a line qualifies 

2 for one of the data services if the line attenuates data at a 

3 preselected frequency by less than a preselected amount 

1 18. The method of claim 17, wherein the electrical 

2 features include one -ended electrical measurements and at 

3 least one property calculated from the one-ended 

4 measurements . 

1 19. The method of claim 19, wherein the determining 

2 parameters comprises learning a portion of the parameters 

3 from the training set. 

1 20. The method of claim 19, wherein the determining 

2 comprises finding a projected subspace in a space of vectors, 



- 22 - 



WO 01/24490 



PCT/US00/25206 



3 components of each vector being the electrical features of an 

4 associated one of the sample lines. 



1 21. The method of claim 21, wherein a portion of the 

2 components of the vectors in the projected subspace have 

3 larger variances over the training set than components of the 

4 vectors orthogonal to the subspace. 

1 22. The method of claim 21, wherein the determining 

2 comprises identifying clusters in a distribution of the 

3 vectors in the projected subspace. 

1 23. The method of claim 19, wherein the determining 

2 comprises identifying clusters in a distribution of vectors, 

3 each vector corresponding to a list of electrical properties 

4 of one of the sample lines of the training set. 

1 24. The method of claim 24, wherein identifying 

2 includes determining parameters for cluster sizes and 

3 locations. 

1 25. The method of claim 24, wherein the determining 

2 further comprises: 

3 finding weights that correlate membership in a cluster 

4 to qualification for one of the preselected data services. 

1 26. The method of claim 18, wherein one of the 

2 preselected data services is one of ADSL and ISDN. 

1 27. A data storage medium storing a computer 

2 executable program for a method of testing a subscriber line, 

3 the method comprising: 

4 determining values of electrical line features from 
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5 electrical measurements on the subscriber line; and 

6 processing a portion of the values of the electrical 

7 features with a neural network that predicts whether the line 

8 qualifies to support one or more preselected data services 

9 from the portion of the values. 

1 28. The medium of claim 28, wherein a line qualifies 

2 for one of the data services if the line attenuates data at a 

3 preselected frequency by less than a preselected amount . 

1 29. The medium of claim 28, wherein the preselected 

2 frequency is substantially higher than the frequencies of the 

3 electrical measurements. 

1 30. The method of claim 28, wherein the measurements 

2 are one-ended electrical measurements. 

1 31. The medium of claim 4, the method further 

2 comprising 

3 compressing a vector whose components are the 

4 electrical features into a subspace; and 

5 predicting whether the line qualifies based on the 

6 compressed vector. 

1 32. The medium of claim 32, wherein variances of 

2 lengths of components of a set of sample lines have smaller 

3 values in an orthogonal complement of the subspace than in 

4 the subspace. 

1 33. The medium of claim 30, wherein the processing 

2 comprises : 

3 determining whether the vector belongs to a cluster of 

4 feature vectors, each feature vector of the cluster being 
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5 associated with a sample line of a training set. 



1 34. The medium of claim 34, wherein the vector belongs 

2 to the cluster in response to being within a geometrical 

3 region defined by the feature vectors of the training set 

4 that belong to the cluster. 

1 35. The medium of claim 34, the method further 

2 comprising: 

3 predicting that the subscriber line is qualified for a 

4 particular data service in response to the vector belonging 

5 to one of the clusters defined by feature vectors of sample 

6 lines that are qualified for the particular data service. 

1 36. The medium of claim 34, the method further 

2 comprising: 

3 predicting that the subscriber line is disqualified for 

4 a particular data service in response to the vector belonging 

5 to one of the clusters defined by feature vectors of sample 

6 lines that are disqualified for the particular data service. 

1 37. The medium of claim 28, wherein the one or more 

2 data services comprise one of ISDN and ADSL. 

1 38. The medium of claim 29, further comprising: 

2 determining whether the subscriber line has a line fault. 

1 3 9. A method of testing a subscriber line, comprising: 

2 determining values of electrical features of the line 

3 from electrical measurements on the subscriber line; and 

4 forming a vector having the values as components; 

5 determining whether the vector is a member of a cluster 

6 of feature vectors, feature vectors of the cluster being 
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7 associated with a sample lines of a training set; and 

8 predicting whether the line qualifies for a data 

9 service based in part on the determination of cluster 
10 membership. 

1 40. The method of claim 40, further comprising: 

2 projecting out a portion of the components of the 

3 vector having features of the subscriber line as components; 

4 and 

5 wherein the act of determining membership is based on 

6 the compressed vector. 

1 41. The method of claim 40, wherein a line qualifies 

2 for the data service if the line attenuates data signals at a 

3 preselected frequency by less than a preselected amount. 

1 42. The method of claim 42, wherein the preselected 

2 frequency is substantially higher than the frequencies of the 

3 electrical measurements. 

1 43. The method of claim 40, wherein the measurements 

2 are one-ended electrical measurements. 

1 44. The method of claim 40, wherein the vector belongs 

2 to the cluster in response to being within a geometrical 

3 region defined by the feature vectors of the training set 

4 that belong to the cluster. 

1 45. The method of claim 2, wherein the data service is 

2 one of ISDN and ADSL. 
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